Demonstrating quantum speed-up in a superconducting two-qubit processor 
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We operate a superconducting quantum processor consisting of two tunable transmon qubits 
coupled by a swapping interaction, and equipped with non destructive single-shot readout of the 
two qubits. With this processor, we run the Grover search algorithm among four objects and 
find that the correct answer is retrieved after a single run with a success probability between 0.52 
and 0.67, significantly larger than the 0.25 achieved with a classical algorithm. This constitutes a 
proof-of-concept for the quantum speed-up of electrical quantum processors. 



The proposition of quantum algorithms |TH3| that per- 
form useful computational tasks more efficiently than 
classical algorithms has motivated the realization of phys- 
ical systems [3] able to implement them and to demon- 
strate quantum speed-up. The versatility and the poten- 
tial scalability of electrical circuits make them very ap- 
pealing for implementing a quantum processor built as 
sketched in Fig. [T| Ideally, a quantum processor consists 
of a scalable set of quantum bits that can be efficiently 
reset, that can follow any unitary evolution needed by an 
algorithm using a universal set of single and two qubit 
gates, and that can be read projectively |S]. The non- 
unitary projective readout operations can be performed 
at various stages of an algorithm, and in any case at the 
end in order to get the final outcome. Quantum proces- 
sors based on superconducting qubits have already been 
operated, but they fail to meet the above criteria in differ- 
ent aspects. With the transmon qubit 0[7] derived from 
the Cooper pair box [8], two simple quantum algorithms, 
namely the Deutsch-Jozsa algorithm |IJ] and the Grover 
search algorithm [T], were demonstrated in a two qubit 
processor with the coupling between the qubits mediated 
by a cavity also used for readout |10| . In this circuit, 
the qubits are not read independently, but the value of 
a single collective variable is determined from the cavity 
transmission measured over a large number of repeated 
sequences. By applying suitable qubit rotations prior to 
this measurement, the density matrix of the two-qubit 
register was inferred at different steps of the algorithm, 
and found in good agreement with the predicted one. 
Demonstrating quantum speed-up is however more de- 
manding than measuring a collective qubit variable since 
it requests to obtain an outcome after a single run, i.e. 
to perform the single-shot readout of the qubit register. 
Up to now, single-shot readout in superconducting pro- 
cessors has been achieved only for phase qubits [111 IT2"] . 
In a two phase-qubit processor equipped with single-shot 
but destructive readout of the qubits, the Deutsch-Jozsa 
algorithm [5] was demonstrated in Ref. [TT] with a success 
probability of order 0.7 in a single run, to be compared 
to 0.5 for a classical algorithm. 




Figure 1: Schematic blueprint of a quantum processor based 
on quantum gates, represented here in the two-qubit case rel- 
evant for our experiment. A quantum processor consists of a 
qubit register that can perform any unitary evolution needed 
by an algorithm under the effect of a universal set of quantum 
gates (single qubit gate Ui , two-qubit gate 1/2)- Ideally, all 
the qubits may be read projectively, and may be reset. 



Since the Deutsch-Jozsa classification algorithm is not 
directly related to any practical situation, demonstrat- 
ing quantum speed-up for more useful algorithms in an 
electrical processor designed along the blueprint of Fig. [T] 
is an important goal. In this work, we operate a new 
two transmon-qubit processor [13] that comes closer to 
the ideal scheme than those previously mentioned, and 
we run the Grover search algorithm among four objects. 
Since, in this case, the algorithm ideally yields the answer 
after one algorithm step, its success probability after a 
single run provides a simple benchmark. We find that 
our processor yields the correct answer at each run with 
a success probability that ranges between 0.52 and 0.67, 
whereas a single step classical algorithm using a random 
query would yield the correct answer with probability 
0.25. 

The scheme and the operation mode of our processor is 
shown in Fig.[2j Two tunable transmon qubits coupled by 
a fixed capacitor, are embedded in two identical control 
and readout sub-circuits. The Hamiltonian of the two 
qubits {1,11} is H/h = (-vial - una 1 } + 2ga l y af) /2, 
where cr x ,y,z are the Pauli operators, vi u the qubit fre- 
quency controlled by the flux applied to each transmon 
SQUID loop with a fast (0.5 GHz bandwidth) local cur- 
rent line, and g — 4.6MHz <C i/[ 11 the coupling frequency 
controlled by the coupling capacitance. The achieved fre- 
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Figure 2: Electrical scheme of the two qubit circuit operated 
and typical sequence during processor operation, (a) Two ca- 
pacitively coupled transmon qubits (green) have tunable fre- 
quencies controlled by the flux induced in their SQUID loop 
by a local current line (in red). The coupling capacitance 
(in magenta) yields a swapping evolution between the qubits 
when on resonance. Each transmon is embedded in a non- 
linear resonator used for single-shot readout. Each reflected 
readout pulse is routed to a cryogenic amplifier through cir- 
culators, homodyned at room temperature and acquired dig- 
itally, which yields a two-bit outcome, (b) Typical operation 
of the processor showing the resonant microwave pulses a(t) 
applied to the qubits (green) and to the readouts (blue), on 
top of the DC pulses (red lines) that vary the transition fre- 
quencies of qubit I (solid) and II (dashed). With the qubits 
tuned at a first working point for single qubit gates, resonant 
pulses are applied for performing X and Y rotations, as well 
as small flux pulses for Z rotations; qubits are then moved to 
the interaction point for two-qubit gate operations. Such se- 
quences can be combined as needed by the algorithm. Qubits 
are then moved to their initial working points for applying 
tomography pulses as well as a |1) — ► |2) pulse Xi2(7r) to in- 
crease the fidelity of the forthcoming readout. Finally, they 
are moved to better readout points and read. 



quency control allows us to place the two transmons on 
resonance during times precise enough for performing the 
universal two-qubit gate ViSWAP [13] and the exchange 
gate iSWAP used in this work. The qubit frequencies 
are tuned to different values for single qubit manipula- 
tion, two-qubit gate operation, and readout. The readout 
is independently and simultaneously performed for each 
qubit using the single-shot method of Ref . [T4j . It is based 
on the dynamical transition of a non-linear resonator 
|15l 116] that maps the quantum state of each transmon 
to the bifurcated/non bifurcated state of its resonator, 



which yields a binary outcome for each qubit. This read- 
out method is potentially non-destructive, but its non- 
destructive character is presently limited by relaxation 
during the readout pulse. In order to further improve 
the readout fidelity, we resort to a shelving method that 
exploits the second excited state of the transmon. For 
this purpose, a microwave pulse that induces a transition 
from the state |1) towards the second excited state |2) 
of the transmon is applied just before the readout pulse 
as demonstrated in Ref. [14 . This variant does not alter 
the non-destructive aspect of the readout method since 
an extra pulse bringing state \2) back to state |1) could 
be applied after readout. Although the readout contrast 
achieved with this shelving method and with optimized 
microwave pulses reaches 0.88 and 0.89 for the two qubits 
respectively, the values achieved at working points suit- 
able for processor operation are lower and equal to 0.84 
and 0.83. The readout outcome probabilities for all input 
states of the two-qubit register are given in the Supple- 
mentary Information S4, with a discussion of the error 
sources. 

In order to characterize the evolution of the quan- 
tum register during the algorithm, we determine its den- 
sity matrix by state tomography. For this purpose, we 
measure the expectation values of the extended Pauli 
set of operators {u x I, ..,a z cr z } by applying the suitable 
rotations just before readout and by averaging typi- 
cally 10 4 times. Note that the readout errors, which 
can be well-characterized, are corrected when determin- 
ing the expectation value of the Pauli set, and thus 
do not contribute to tomography errors as explained in 
Ref. [T3] • The density matrix p is then taken as the ac- 
ceptable positive-semidefinite matrix that, according to 
the Hilbert-Schmidt distance, is the closest to the possi- 
bly non physical one derived from the measurement set. 
In order to characterize the fidelity of the algorithm at 
all steps, we use the state fidelity F = (ip \p\ i/j) with \ip) 
the ideal quantum state at the step considered; F is in 
this case the probability for the qubit register to be in 
state |?/>). 

The Grover search algorithm [T] consists in retrieving 
a particular basis state in a Hilbert space of size N using 
a function able to discriminate it from the other ones. 
This function is used to build an oracle operator that 
tags the searched state. Starting from the superposition 
\<p) of all register states, a unitary sequence that incor- 
porates the oracle operator is repeated about \/N times, 
and eventually yields the searched state with a high prob- 
ability. The implementation of Grover 's algorithm in a 
two-qubit Hilbert space often proceeds in a simpler way 
|17fl2"2"] since the result is obtained with certainty after 
a single algorithm step. The algorithm then consists of 
an encoding sequence depending on the searched state, 
followed by a universal decoding sequence that retrieves 
it. Grover's algorithm thus provides a simple benchmark 
for two-qubit processors. Its implementation with our 
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quantum processor is shown in Fig. [3ja). First, the su- 
perposed state \(j>) is obtained by applying n/2 rotations 
around the Y axis for the two qubits. The oracle opera- 
tor O uv tagging the two-qubit state \uv) = \u) 1 ® |i>) n to 
be searched is then applied to state \4>). Each O uv con- 
sists of a iSWAP gate followed by a Z(±n/2) rotation 
on each qubit, with the four possible sign combinations 
(-,-), (+,-) , (-,+) , and (+,+) corresponding to 
uv — 00, 01, 10, and 11, respectively. In the algorithm 
we use, as in Ref. |I10|. the encoding is a phase encod- 
ing. When applied to \<p), each oracle operator inverts 
the sign of the component corresponding to the state it 
tags, respectively to the other ones. The density matrix 
after applying the oracle ideally takes a simple form: the 
amplitude of all coefficients is 1/4, and the phase of an 
element p rs is ip rs = 7r(<5 rt + S s %), where t corresponds 
to the state tagged by the oracle operator. The state 
tomography performed after applying the oracle, shown 
in Fig. |3jb), is in good agreement with this prediction. 
More quantitatively, we find that after having applied the 
oracle operator, the intermediate fidelity is Fj nt = 0.87, 
0.80, 0.84, and 0.82, respectively. The last part of the 
algorithm consists in transforming the obtained state in 
the searched state irrespectively of it, or equivalently to 
transform the phase information distributed over the el- 
ements of the density matrix in a weight information 
with the whole weight on the searched state. This op- 
eration is readily performed by applying an iSWAP gate 
followed by Y(ir/2) rotations on each qubit. We find that 
the fidelity of the density matrix at the end of the algo- 
rithm is .Ffinai = 0.70, 0.62, 0.67, and 0.66 respectively. 
We explain both Fj nt and -Ffinai by gate errors at a 2% 
level, by errors in the tomography pulses at a 2% level, 
as well as by decoherence during the whole experimen- 
tal sequence (at the coupling point, relaxation times are 
T\ ~ 450 ns and T-J 1 ~ 500 ns, and the effective dephasing 
times ~ T^ 1 ~ 2 ps [13] )• -Ffinai is thus approximately 
the success probability one would obtain assuming no er- 
rors when reading the qubit register at the end of the 
algorithm. 

We now consider the success probability obtained after 
a single run (with no tomography pulses), which probes 
the quantum speed-up actually achieved by the proces- 
sor. We find (see Fig. [3]) that our processor does yield the 
correct answer with a success probability P$ — 0.67, 0.55, 
0.62, and 0.52 for the four basis states, which is smaller 
than the density matrix fidelity -Ffinai- One notices that 
the difference between Fa na \ and P§ , mostly due to read- 
out errors, slightly depends on the searched state: the 
larger the energy of the searched state, the larger the dif- 
ference. This dependence is well explained by the effect 
of relaxation during the readout pulse, which is the main 
error source at readout, the second one being readout 
crosstalk. One also notices that the outcome errors are 
distributed over all the wrong answers. To summarize, 
the error in the outcome of Grover's algorithm originate 



both from small unitary errors accumulated during the 
algorithm, and from decoherence during the whole se- 
quence, in particular during the final readout. 
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Figure 3: (a) Experimental sequence used for implement- 
ing the Grover search algorithm on four objects. First, 
Y(n/2) rotations are applied to produce the superposition 
\4>) — (1/2)53 v \ uv ) °f au t> as i s states; then one of the four 
possible oracle (corresponding to the four sign combinations of 
the Z rotations) is applied. The tagged state is then decoded 
in all cases using a iSWAP operation followed by Y(ir/2) ro- 
tations, (b) State tomography at two steps of the algorithm 
and success probability after a single run. The yellow dot on 
the left marks the basis state tagged by each oracle operator 
used. After applying the oracle, the information on the tagged 
state is encoded in the phase of six particular elements of the 
density matrix p. After decoding, the tagged state should be 
the only matrix element present in p. The amplitude of each 
matrix element is represented by a disk (black for the ideal 
density matrix, red for the measured one) and its phase by a 
radius (as well as a filling color for the ideal matrice). The 
probability distribution of the single-run readout outcomes is 
indicated on the right (yellow box for the correct answer, filled 
dark boxes for the wrong ones). 
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We now discuss the significance of the obtained re- 
sults in terms of quantum information processing. The 
achieved success probability is smaller than the theoreti- 
cally achievable value 1, but nevertheless sizeably larger 
than the value of 0.25 obtained by running once the clas- 
sical algorithm that consists in making a random trial. 
From the point of view of a user that would search which 
unknown oracle picked at random has been given to 
him. the fidelity of the algorithm outcome is F = 0.57, 
0.63, 0.57, and 0.59 for the 00, 01, 10, and 11 outcomes 
respectively, as explained in the Supplementary Infor- 
mation S5. Despite the presence of errors, this result 
demonstrates the quantum speed-up for Grover's algo- 
rithm when searching in a Hilbert space with small size 
N = 4. Demonstrating the speed-up for Grover's al- 
gorithm in larger Hilbert spaces requires a qubit architec- 
ture more scalable than the present one, which presently 
is a major challenge in the field. 

In conclusion, we have demonstrated the operation of 
the Grover search algorithm in a superconducting two- 
qubit processor with single-shot non destructive read- 
out. This result indicates that the quantum speed-up 
expected from quantum algorithms is within reach of su- 
perconducting quantum bit processors. 
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Supplementary material 

51. Sample preparation 

The sample is fabricated on a silicon chip oxidized over 50 nm. A 150 nm thick niobium layer is first deposited by 
magnetron sputtering and then dry-etched in a SF e plasma to pattern the readout resonators, the current lines for 
frequency tuning, and their ports. Finally, the transmon qubit, the coupling capacitance and the Josephson junctions 
of the resonators are fabricated by double-angle evaporation of aluminum through a shadow mask patterned by 
e-beam lithography. The first layer of aluminum is oxidized in a Ar — O2 mixture to form the oxide barrier of the 
junctions. The chip is glued with wax on a printed circuit board (PCB) and wire bonded to it. The PCB is then 
screwed in a copper box anchored to the cold plate of a dilution refrigerator. 

52. Sample parameters 

The sample is first characterized by spectroscopy (see Fig. l.b of main text). The incident power used is high 
enough to observe the resonator frequency i/r, the qubit line is i, and the two-photon transition at frequency ^02/2 
between the ground and second excited states of each transmon (data not shown). A fit of the transmon model 
to the data yields the sample parameters E]/h = 36.2 GHz, E l G /h = 0.98 GHz, d/ = 0.2, Ef/h = 43.1GHz, 
Eq/H = 0.87 GHz, du = 0.35, = 6.84 GHz, and i/pf = 6.70 GHz. The qubit-readout anticrossing at v — i/r yields 
the qubit-readout couplings g\ ~ ~ (27r)50MHz. Independent measurements of the resonator dynamics (data 
not shown) yield quality factors Q\ = Qu = 730 and Kerr non linearities [13,(1]] Ki/v l K ~ Kn/vg ^ -2.3±0.5 x 10~ 5 . 

53. Experimental setup 

• Qubit resonant microwave pulses: The qubit drive pulses are generated by two phase-locked microwave generators 
feeding a pair of I/Q-mixers. The IF inputs are provided by a 4-Channell GS/s arbitrary waveform generator 
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(AWG Tektronix AWG5014). Single-sideband mixing in the frequency range of 50-300 MHz is used to generate 
multi-tone drive pulses and to obtain a high ON/OFF ratio (> 50 dB). Phase and amplitude errors are corrected 
by applying suitable sideband and carrier frequency dependent corrections to the amplitude and offset of the IF 
signals. 

• Qubit frequency control: Flux control pulses are generated by a second AWG and sent to the chip through a 
transmission line equipped with 40 dB total attenuation and a pair of 1 GHz dissipative low-pass filters at 4 K. 
The input signal of each flux line is returned to room temperature through an identical transmission line and 
measured, which allows to compensate the non-ideal frequency response of the line. 

• Readout pulses: The driving pulses for the Josephson bifurcation amplifier (JBA) readouts are generated by 
mixing the continuous signals of a pair of microwave generators with IF pulses provided by a 1 GS / s arbitrary 
waveform generator (AWG Tektronix AWG5014). Each readout pulse consists of a measurement part with a 
rise time of 30 ns and a hold time of 100 ns, followed by a 2 /is long latching part at 90 % of the pulse height. 

• Drive and measurement lines: The drive and readout microwave signals of each qubit arc combined and sent to 
the sample through a pair of transmission lines with total attenuation 70 dB and filtered at 4K and 300 mK. 
A microwave circulator at 20 mK protects the chip from the amplifier noise. The signals are amplified by 36 dB 
at 4 K by two cryogenic HEMT amplifiers (CIT Cryo 1) with noise temperature 5K. The reflected readout 
pulses are amplified and demodulated at room temperature. The IQ quadratures of the demodulated signals 
are sampled at 1 GS/s by a 4-channel data acquisition system (Acqiris DC282). 

S4- Readout Errors 

Errors in our readout scheme are discussed in detail in Ref. [13] for a single qubit. First, incorrect mapping |0) — » 1 
or|l) — > of the projected state of the qubit to the dynamical state of the resonator can occur, due to the stochastic 
nature of the switching between the two dynamical states. As shown in Fig. S4.1, the probability p to obtain the 
outcome 1 varies continuously from to 1 over a certain range of drive power Pa applied to the readout. When the 
shift in power between the two J?|o),|i) (Pd) curves is not much larger than this range, the two curves overlap and errors 
are significant even at the optimal drive power where the difference in p is maximum. Second, even in the case of 
non overlapping P|o).|i) (-Pd) curves, the qubit initially projected in statejl) can relax down to |0) before the end of 
the measurement, yielding an outcome instead of 1. The probability of these two types of errors vary in opposite 
directions as a function of the frequency detuning A = z/r — v > between the resonator and the qubit, so that a 
compromise has to be found for A. As explained in the main text, we use a shelving method to the second excited 
state in order to improve the readout contrast c = Max (p\i) — P\o))i with a microwave tt pulse at frequency 1/12 
bringing state |1) into state |2) just before the readout pulse. The smallest errors ej,' 11 and e*' 11 when reading |0) and 
|1) are found for Ai = 440 MHz and A n = 575 MHz: e\ = 5% and e\ = 13% (contrast ci = 1 - ej> - e\ = 82%), and 
4 1 = 5.5% and ef = 12% (c n = 82%). When using the |1) -> |2) shelving before readout, ef, = 2.5% and e\ = 9.5% 
(contrast c\ == 1 — e\ — e\ = 88%), and e}} = 3% and e\ x = 8% (cn = 89%). These best results are very close to 
those obtained in [12], but cannot however be exploited for simultaneous readout of the two qubits. 

Indeed, when the two qubits are measured simultaneously, we find an influence of the projected state of each qubit 
on the outcome of the readout of the other one. In order to to minimize this spurious effect, we increase the detuning 
Ai.n up to 1 GHz with respect to previous optimal values. An immediate consequence shown in Fig. S4.1(a) is 
a reduction of the ciji contrasts. The errors when reading |0) and |1) are then e\ = 10% and e\ = 16% (contrast 
c\ = 74%) and ej, 1 = 12% and e] 1 = 15% (contrast cjj = 73%). When shelving the qubit in state |2) , the errors are 
eQ = 5%, e\ — 11% (contrast c\ = 84%), ef, 1 = 5%, e\ l — 12% (contrast c\ — 83%). The readout errors are captured 
in the 4x4 readout matrix 1Z shown in Fig. S4.1(c), that gives the probabilities p uv of the four possible outcomes for 
the different input states using the |1) — > |2) shelving technique. This matrix TZ is used to correct the readout errors 
only when doing state tomography, and not when the running the algorithm once. The cause of the readout crosstalk 
in our processor is discussed in Ref. [11]. 

S5. Algorithm Fidelity 

The fidelity of each possible outcome ab £ {00, 01, 10, 11} of our algorithm is given as 

Jab =Pab/|ab>/ (Pab/|00) + Pab/|01) + Pab/|10) + Pab/|ll)) , 

where p a b/| U v) is the conditional probability for obtaining ab when the state \uv) has been marked by the oracle O uv . 
Table |l] shows these probabilities p a b/| U v) f° r ai l possible combinations of ab and \uv) as well as the fidelities / a b. The 
average fidelity of the algorithm is 59.1 %. 
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Figure 4: 

(a) Switching probability p of each readout as a function of its peak driving power, when its qubit is prepared in state |0) 
(blue), |1) (red), or |2) (brown), with the other qubit being far detuned. The arrows indicate the readout errors where the 
contrast is optimal with (brown) and without (red) |1) — > |2) shelving, (b) Readout matrix giving the probabilities of the four 
ab outcomes, for the four computational input states \uv), when using |1) — > |2) shelving. 
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Table I: Conditional probabilities p a b/\ U v) an d statistical fidelities f a b for all possible outcomes ab, measured for our version of 
Grover's algorithm. 
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